Overview

Dataset statistics

Number of variables29
Number of observations150000
Missing cells43492
Missing cells (%)1.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory40.0 MiB
Average record size in memory279.5 B

Variable types

NUM27
CAT1
BOOL1

Reproduction

Analysis started2020-03-23 13:19:23.578049
Analysis finished2020-03-23 14:37:17.135664
Versionpandas-profiling v2.5.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml
v_5 is highly correlated with v_2 and 1 other fieldsHigh Correlation
v_2 is highly correlated with v_5 and 1 other fieldsHigh Correlation
v_6 is highly correlated with v_1 and 1 other fieldsHigh Correlation
v_1 is highly correlated with v_6 and 1 other fieldsHigh Correlation
v_7 is highly correlated with v_2 and 1 other fieldsHigh Correlation
v_8 is highly correlated with v_3High Correlation
v_3 is highly correlated with v_8High Correlation
v_9 is highly correlated with v_4High Correlation
v_4 is highly correlated with v_9 and 1 other fieldsHigh Correlation
v_10 is highly correlated with v_1 and 1 other fieldsHigh Correlation
v_13 is highly correlated with v_4High Correlation
bodyType has 4506 (3.0%) missing values Missing
fuelType has 8680 (5.8%) missing values Missing
gearbox has 5981 (4.0%) missing values Missing
notRepairedDamage has 24324 (16.2%) missing values Missing
power is highly skewed (γ1 = 65.86317787) Skewed
creatDate is highly skewed (γ1 = -79.01331042) Skewed
model has 11762 (7.8%) zeros Zeros
brand has 31480 (21.0%) zeros Zeros
bodyType has 41420 (27.6%) zeros Zeros
fuelType has 91656 (61.1%) zeros Zeros
power has 12829 (8.6%) zeros Zeros
v_5 has 4485 (3.0%) zeros Zeros
v_6 has 35465 (23.6%) zeros Zeros
v_7 has 5467 (3.6%) zeros Zeros
v_8 has 1597 (1.1%) zeros Zeros
v_9 has 3486 (2.3%) zeros Zeros

Variables

SaleID
Real number (ℝ≥0)

UNIFORM
UNIQUE
Distinct count150000
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean74999.5
Minimum0
Maximum149999
Zeros1
Zeros (%)< 0.1%
Memory size1.1 MiB

Quantile statistics

Minimum0
5-th percentile7499.95
Q137499.75
median74999.5
Q3112499.25
95-th percentile142499.05
Maximum149999
Range149999
Interquartile range (IQR)74999.5

Descriptive statistics

Standard deviation43301.41453
Coefficient of variation (CV)0.5773560427
Kurtosis-1.2
Mean74999.5
Median Absolute Deviation (MAD)37500
Skewness0
Sum1.1249925e+10
Variance1875012500
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 149999.], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2047 1 < 0.1%
 
113949 1 < 0.1%
 
15661 1 < 0.1%
 
13612 1 < 0.1%
 
3371 1 < 0.1%
 
1322 1 < 0.1%
 
7465 1 < 0.1%
 
5416 1 < 0.1%
 
27943 1 < 0.1%
 
25894 1 < 0.1%
 
Other values (149990) 149990 > 99.9%
 
ValueCountFrequency (%) 
0 1 < 0.1%
 
1 1 < 0.1%
 
2 1 < 0.1%
 
3 1 < 0.1%
 
4 1 < 0.1%
 
ValueCountFrequency (%) 
149999 1 < 0.1%
 
149998 1 < 0.1%
 
149997 1 < 0.1%
 
149996 1 < 0.1%
 
149995 1 < 0.1%
 

name
Real number (ℝ≥0)

Distinct count99662
Unique (%)66.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean68349.17287
Minimum0
Maximum196812
Zeros4
Zeros (%)< 0.1%
Memory size1.1 MiB

Quantile statistics

Minimum0
5-th percentile811
Q111156
median51638
Q3118841.25
95-th percentile180617.05
Maximum196812
Range196812
Interquartile range (IQR)107685.25

Descriptive statistics

Standard deviation61103.87509
Coefficient of variation (CV)0.8939958236
Kurtosis-1.039944629
Mean68349.17287
Median Absolute Deviation (MAD)53386.51797
Skewness0.5576057626
Sum1.025237593e+10
Variance3733683552
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.000000e+00 5.000000e+00 6.500000e+00 9.500000e+00 1.100000e+01 ... 9.347150e+04 1.087515e+05 1.087525e+05 1.641490e+05 1.968120e+05], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
708 282 0.2%
 
387 282 0.2%
 
55 280 0.2%
 
1541 263 0.2%
 
203 233 0.2%
 
53 221 0.1%
 
713 217 0.1%
 
290 197 0.1%
 
1186 184 0.1%
 
911 182 0.1%
 
Other values (99652) 147659 98.4%
 
ValueCountFrequency (%) 
0 4 < 0.1%
 
3 1 < 0.1%
 
4 1 < 0.1%
 
6 35 < 0.1%
 
7 1 < 0.1%
 
ValueCountFrequency (%) 
196812 1 < 0.1%
 
196811 1 < 0.1%
 
196810 1 < 0.1%
 
196809 1 < 0.1%
 
196807 1 < 0.1%
 

regDate
Real number (ℝ≥0)

Distinct count3894
Unique (%)2.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20034170.51
Minimum19910001
Maximum20151212
Zeros0
Zeros (%)0.0%
Memory size1.1 MiB

Quantile statistics

Minimum19910001
5-th percentile19950209
Q119990912
median20030912
Q320071109
95-th percentile20120811
Maximum20151212
Range241211
Interquartile range (IQR)80197

Descriptive statistics

Standard deviation53649.87926
Coefficient of variation (CV)0.00267791867
Kurtosis-0.6973078263
Mean20034170.51
Median Absolute Deviation (MAD)44768.43009
Skewness0.02849507907
Sum3.005125577e+12
Variance2878309544
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[19910001. 19910011.5 19910101.5 19910111.5 19910202.5 ... 20151011.5 20151101.5 20151111.5 20151201.5 20151212. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
20000008 180 0.1%
 
20000011 158 0.1%
 
20000004 157 0.1%
 
20000010 157 0.1%
 
20000002 155 0.1%
 
20000009 154 0.1%
 
20000005 151 0.1%
 
20000001 149 0.1%
 
20000006 143 0.1%
 
20000007 142 0.1%
 
Other values (3884) 148454 99.0%
 
ValueCountFrequency (%) 
19910001 14 < 0.1%
 
19910002 14 < 0.1%
 
19910003 12 < 0.1%
 
19910004 11 < 0.1%
 
19910005 15 < 0.1%
 
ValueCountFrequency (%) 
20151212 1 < 0.1%
 
20151211 3 < 0.1%
 
20151210 5 < 0.1%
 
20151209 1 < 0.1%
 
20151208 4 < 0.1%
 

model
Real number (ℝ≥0)

ZEROS
Distinct count248
Unique (%)0.2%
Missing1
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean47.12902086
Minimum0
Maximum247
Zeros11762
Zeros (%)7.8%
Memory size1.1 MiB

Quantile statistics

Minimum0
5-th percentile0
Q110
median30
Q366
95-th percentile162
Maximum247
Range247
Interquartile range (IQR)56

Descriptive statistics

Standard deviation49.53603965
Coefficient of variation (CV)1.051072964
Kurtosis1.740483202
Mean47.12902086
Median Absolute Deviation (MAD)37.71013974
Skewness1.484387653
Sum7069306
Variance2453.819225
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0 11762 7.8%
 
19 9573 6.4%
 
4 8445 5.6%
 
1 6038 4.0%
 
29 5186 3.5%
 
48 5052 3.4%
 
40 4502 3.0%
 
26 4496 3.0%
 
8 4391 2.9%
 
31 3827 2.6%
 
Other values (238) 86727 57.8%
 
ValueCountFrequency (%) 
0 11762 7.8%
 
1 6038 4.0%
 
2 286 0.2%
 
3 920 0.6%
 
4 8445 5.6%
 
ValueCountFrequency (%) 
247 1 < 0.1%
 
246 7 < 0.1%
 
245 2 < 0.1%
 
244 3 < 0.1%
 
243 4 < 0.1%
 

brand
Real number (ℝ≥0)

ZEROS
Distinct count40
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8.052733333
Minimum0
Maximum39
Zeros31480
Zeros (%)21.0%
Memory size1.1 MiB

Quantile statistics

Minimum0
5-th percentile0
Q11
median6
Q313
95-th percentile25
Maximum39
Range39
Interquartile range (IQR)12

Descriptive statistics

Standard deviation7.864956341
Coefficient of variation (CV)0.9766815832
Kurtosis1.076201475
Mean8.052733333
Median Absolute Deviation (MAD)6.277367452
Skewness1.150760283
Sum1207910
Variance61.85753825
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 0.5 1.5 2.5 3.5 ... 32.5 36.5 37.5 38.5 39. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 31480 21.0%
 
4 16737 11.2%
 
14 16089 10.7%
 
10 14249 9.5%
 
1 13794 9.2%
 
6 10217 6.8%
 
9 7306 4.9%
 
5 4665 3.1%
 
13 3817 2.5%
 
11 2945 2.0%
 
Other values (30) 28701 19.1%
 
ValueCountFrequency (%) 
0 31480 21.0%
 
1 13794 9.2%
 
2 321 0.2%
 
3 2461 1.6%
 
4 16737 11.2%
 
ValueCountFrequency (%) 
39 9 < 0.1%
 
38 65 < 0.1%
 
37 333 0.2%
 
36 228 0.2%
 
35 180 0.1%
 

bodyType
Real number (ℝ≥0)

MISSING
ZEROS
Distinct count8
Unique (%)< 0.1%
Missing4506
Missing (%)3.0%
Infinite0
Infinite (%)0.0%
Mean1.792369445
Minimum0
Maximum7
Zeros41420
Zeros (%)27.6%
Memory size1.1 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q33
95-th percentile6
Maximum7
Range7
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.760639503
Coefficient of variation (CV)0.9822972091
Kurtosis0.2069369316
Mean1.792369445
Median Absolute Deviation (MAD)1.404709438
Skewness0.991529939
Sum260779
Variance3.099851461
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0 41420 27.6%
 
1 35272 23.5%
 
2 30324 20.2%
 
3 13491 9.0%
 
4 9609 6.4%
 
5 7607 5.1%
 
6 6482 4.3%
 
7 1289 0.9%
 
(Missing) 4506 3.0%
 
ValueCountFrequency (%) 
0 41420 27.6%
 
1 35272 23.5%
 
2 30324 20.2%
 
3 13491 9.0%
 
4 9609 6.4%
 
ValueCountFrequency (%) 
7 1289 0.9%
 
6 6482 4.3%
 
5 7607 5.1%
 
4 9609 6.4%
 
3 13491 9.0%
 

fuelType
Real number (ℝ≥0)

MISSING
ZEROS
Distinct count7
Unique (%)< 0.1%
Missing8680
Missing (%)5.8%
Infinite0
Infinite (%)0.0%
Mean0.3758420606
Minimum0
Maximum6
Zeros91656
Zeros (%)61.1%
Memory size1.1 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile1
Maximum6
Range6
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.5486766226
Coefficient of variation (CV)1.459859553
Kurtosis5.8800487
Mean0.3758420606
Median Absolute Deviation (MAD)0.4875202364
Skewness1.595485994
Sum53114
Variance0.3010460362
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0 91656 61.1%
 
1 46991 31.3%
 
2 2212 1.5%
 
3 262 0.2%
 
4 118 0.1%
 
5 45 < 0.1%
 
6 36 < 0.1%
 
(Missing) 8680 5.8%
 
ValueCountFrequency (%) 
0 91656 61.1%
 
1 46991 31.3%
 
2 2212 1.5%
 
3 262 0.2%
 
4 118 0.1%
 
ValueCountFrequency (%) 
6 36 < 0.1%
 
5 45 < 0.1%
 
4 118 0.1%
 
3 262 0.2%
 
2 2212 1.5%
 

gearbox
Boolean

MISSING
Distinct count2
Unique (%)< 0.1%
Missing5981
Missing (%)4.0%
Memory size1.1 MiB
0
111623
1
32396
(Missing)
 
5981
ValueCountFrequency (%) 
0 111623 74.4%
 
1 32396 21.6%
 
(Missing) 5981 4.0%
 

power
Real number (ℝ≥0)

SKEWED
ZEROS
Distinct count566
Unique (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean119.3165467
Minimum0
Maximum19312
Zeros12829
Zeros (%)8.6%
Memory size1.1 MiB

Quantile statistics

Minimum0
5-th percentile0
Q175
median110
Q3150
95-th percentile232
Maximum19312
Range19312
Interquartile range (IQR)75

Descriptive statistics

Standard deviation177.1684192
Coefficient of variation (CV)1.484860433
Kurtosis5733.451054
Mean119.3165467
Median Absolute Deviation (MAD)53.74223049
Skewness65.86317787
Sum17897482
Variance31388.64875
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.0000e+00 5.0000e-01 4.5000e+00 5.5000e+00 3.2500e+01 ... 1.0140e+03 2.0945e+03 7.5115e+03 7.5365e+03 1.9312e+04], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 12829 8.6%
 
75 9593 6.4%
 
150 6495 4.3%
 
60 6374 4.2%
 
140 5963 4.0%
 
101 5537 3.7%
 
116 5177 3.5%
 
90 4890 3.3%
 
170 4791 3.2%
 
105 4457 3.0%
 
Other values (556) 83894 55.9%
 
ValueCountFrequency (%) 
0 12829 8.6%
 
1 8 < 0.1%
 
2 3 < 0.1%
 
3 3 < 0.1%
 
4 9 < 0.1%
 
ValueCountFrequency (%) 
19312 1 < 0.1%
 
17932 1 < 0.1%
 
17700 1 < 0.1%
 
17410 1 < 0.1%
 
17322 1 < 0.1%
 

kilometer
Real number (ℝ≥0)

Distinct count13
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12.59716
Minimum0.5
Maximum15
Zeros0
Zeros (%)0.0%
Memory size1.1 MiB

Quantile statistics

Minimum0.5
5-th percentile4
Q112.5
median15
Q315
95-th percentile15
Maximum15
Range14.5
Interquartile range (IQR)2.5

Descriptive statistics

Standard deviation3.919575532
Coefficient of variation (CV)0.3111475549
Kurtosis1.141934188
Mean12.59716
Median Absolute Deviation (MAD)3.103732409
Skewness-1.525921365
Sum1889574
Variance15.36307235
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0.5 0.75 1.5 3.5 4.5 ... 8.5 9.5 11.25 13.75 15. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
15 96877 64.6%
 
12.5 15722 10.5%
 
10 6459 4.3%
 
9 5257 3.5%
 
8 4573 3.0%
 
7 4084 2.7%
 
6 3725 2.5%
 
5 3144 2.1%
 
4 2718 1.8%
 
3 2501 1.7%
 
Other values (3) 4940 3.3%
 
ValueCountFrequency (%) 
0.5 1840 1.2%
 
1 746 0.5%
 
2 2354 1.6%
 
3 2501 1.7%
 
4 2718 1.8%
 
ValueCountFrequency (%) 
15 96877 64.6%
 
12.5 15722 10.5%
 
10 6459 4.3%
 
9 5257 3.5%
 
8 4573 3.0%
 

notRepairedDamage
Categorical

MISSING
Distinct count2
Unique (%)< 0.1%
Missing24324
Missing (%)16.2%
Memory size1.1 MiB
0.0
111361
1.0
 
14315
ValueCountFrequency (%) 
0.0 111361 74.2%
 
1.0 14315 9.5%
 
(Missing) 24324 16.2%
 

Length

Max length3
Mean length3
Min length3
ValueCountFrequency (%) 
Decimal_Number 2 40.0%
 
Lowercase_Letter 2 40.0%
 
Other_Punctuation 1 20.0%
 
ValueCountFrequency (%) 
Common 3 60.0%
 
Latin 2 40.0%
 
ValueCountFrequency (%) 
ASCII 5 100.0%
 

regionCode
Real number (ℝ≥0)

Distinct count7905
Unique (%)5.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2583.077267
Minimum0
Maximum8120
Zeros63
Zeros (%)< 0.1%
Memory size1.1 MiB

Quantile statistics

Minimum0
5-th percentile188
Q11018
median2196
Q33843
95-th percentile6244.05
Maximum8120
Range8120
Interquartile range (IQR)2825

Descriptive statistics

Standard deviation1885.363218
Coefficient of variation (CV)0.7298903685
Kurtosis-0.3408317795
Mean2583.077267
Median Absolute Deviation (MAD)1560.954126
Skewness0.6888811779
Sum387461590
Variance3554594.464
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.0000e+00 5.0000e-01 3.5000e+00 6.5000e+00 9.5000e+00 ... 7.3645e+03 7.4715e+03 7.8045e+03 7.9785e+03 8.1200e+03], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
419 369 0.2%
 
764 258 0.2%
 
125 137 0.1%
 
176 136 0.1%
 
462 134 0.1%
 
428 132 0.1%
 
24 130 0.1%
 
1184 130 0.1%
 
122 129 0.1%
 
828 126 0.1%
 
Other values (7895) 148319 98.9%
 
ValueCountFrequency (%) 
0 63 < 0.1%
 
1 17 < 0.1%
 
2 26 < 0.1%
 
3 30 < 0.1%
 
4 65 < 0.1%
 
ValueCountFrequency (%) 
8120 1 < 0.1%
 
8117 1 < 0.1%
 
8113 1 < 0.1%
 
8112 1 < 0.1%
 
8109 1 < 0.1%
 

creatDate
Real number (ℝ≥0)

SKEWED
Distinct count96
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20160330.79
Minimum20150618
Maximum20160407
Zeros0
Zeros (%)0.0%
Memory size1.1 MiB

Quantile statistics

Minimum20150618
5-th percentile20160307
Q120160313
median20160321
Q320160329
95-th percentile20160404
Maximum20160407
Range9789
Interquartile range (IQR)16

Descriptive statistics

Standard deviation106.7328088
Coefficient of variation (CV)5.294199283e-06
Kurtosis6881.080328
Mean20160330.79
Median Absolute Deviation (MAD)23.39535659
Skewness-79.01331042
Sum3.024049619e+12
Variance11391.89248
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[20150618. 20151110. 20151222. 20160104.5 20160130.5 ... 20160401.5 20160402.5 20160404.5 20160405.5 20160407. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
20160403 5848 3.9%
 
20160404 5606 3.7%
 
20160320 5485 3.7%
 
20160312 5383 3.6%
 
20160402 5382 3.6%
 
20160321 5361 3.6%
 
20160314 5278 3.5%
 
20160328 5218 3.5%
 
20160319 5213 3.5%
 
20160307 5203 3.5%
 
Other values (86) 96023 64.0%
 
ValueCountFrequency (%) 
20150618 1 < 0.1%
 
20150807 1 < 0.1%
 
20150810 1 < 0.1%
 
20150904 2 < 0.1%
 
20150909 1 < 0.1%
 
ValueCountFrequency (%) 
20160407 223 0.1%
 
20160406 505 0.3%
 
20160405 1688 1.1%
 
20160404 5606 3.7%
 
20160403 5848 3.9%
 

price
Real number (ℝ≥0)

Distinct count3763
Unique (%)2.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5923.327333
Minimum11
Maximum99999
Zeros0
Zeros (%)0.0%
Memory size1.1 MiB

Quantile statistics

Minimum11
5-th percentile400
Q11300
median3250
Q37700
95-th percentile19970
Maximum99999
Range99988
Interquartile range (IQR)6400

Descriptive statistics

Standard deviation7501.998477
Coefficient of variation (CV)1.266517627
Kurtosis18.99518336
Mean5923.327333
Median Absolute Deviation (MAD)4995.003947
Skewness3.346486763
Sum888499100
Variance56279981.14
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[1.10000e+01 4.95000e+01 5.25000e+01 5.95000e+01 7.95000e+01 ... 8.49940e+04 8.49995e+04 8.94750e+04 9.99945e+04 9.99990e+04], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
500 2337 1.6%
 
1500 2158 1.4%
 
1200 1922 1.3%
 
1000 1850 1.2%
 
2500 1821 1.2%
 
600 1535 1.0%
 
3500 1533 1.0%
 
800 1513 1.0%
 
2000 1378 0.9%
 
999 1356 0.9%
 
Other values (3753) 132597 88.4%
 
ValueCountFrequency (%) 
11 2 < 0.1%
 
12 3 < 0.1%
 
13 6 < 0.1%
 
14 1 < 0.1%
 
15 13 < 0.1%
 
ValueCountFrequency (%) 
99999 5 < 0.1%
 
99990 1 < 0.1%
 
99900 1 < 0.1%
 
98430 1 < 0.1%
 
98000 2 < 0.1%
 

v_0
Real number (ℝ≥0)

Distinct count143997
Unique (%)96.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean44.40626753
Minimum30.45197649
Maximum52.30417826
Zeros0
Zeros (%)0.0%
Memory size1.1 MiB

Quantile statistics

Minimum30.45197649
5-th percentile40.9957098
Q143.13579888
median44.61026572
Q346.0047209
95-th percentile47.74725825
Maximum52.30417826
Range21.85220177
Interquartile range (IQR)2.868922015

Descriptive statistics

Standard deviation2.457547906
Coefficient of variation (CV)0.05534236591
Kurtosis3.993841049
Mean44.40626753
Median Absolute Deviation (MAD)1.794836875
Skewness-1.316712325
Sum6660940.13
Variance6.039541711
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[30.45197649 31.48446762 32.33477729 33.17923568 33.99008494 ... 49.92855621 50.45940504 50.90603274 51.3507101 52.30417826], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
45.34911469 20 < 0.1%
 
48.0872165 16 < 0.1%
 
47.56845031 15 < 0.1%
 
48.61814962 15 < 0.1%
 
47.84035689 15 < 0.1%
 
48.25143023 15 < 0.1%
 
48.43343972 14 < 0.1%
 
47.80298905 14 < 0.1%
 
48.26590733 14 < 0.1%
 
47.66644402 12 < 0.1%
 
Other values (143987) 149850 99.9%
 
ValueCountFrequency (%) 
30.45197649 1 < 0.1%
 
30.6073942 1 < 0.1%
 
30.82789938 1 < 0.1%
 
30.8293016 1 < 0.1%
 
31.00220029 1 < 0.1%
 
ValueCountFrequency (%) 
52.30417826 1 < 0.1%
 
51.88151656 1 < 0.1%
 
51.81542136 1 < 0.1%
 
51.71244476 1 < 0.1%
 
51.70440641 1 < 0.1%
 

v_1
Real number (ℝ)

HIGH CORRELATION
Distinct count143998
Unique (%)96.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-0.04480912301
Minimum-4.295588903
Maximum7.320308375
Zeros0
Zeros (%)0.0%
Memory size1.1 MiB

Quantile statistics

Minimum-4.295588903
5-th percentile-3.315105571
Q1-3.192349286
median-3.052671416
Q34.000669795
95-th percentile5.156493473
Maximum7.320308375
Range11.61589728
Interquartile range (IQR)7.193019081

Descriptive statistics

Standard deviation3.641893018
Coefficient of variation (CV)-81.27570398
Kurtosis-1.753016996
Mean-0.04480912301
Median Absolute Deviation (MAD)3.537543317
Skewness0.3594542872
Sum-6721.368452
Variance13.26338475
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[-4.2955889 -4.11414196 -3.99357171 -3.84929112 -3.78423635 ... 6.02246016 6.32061473 6.38914498 6.84698975 7.32030837], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
-3.245133027 20 < 0.1%
 
3.183322711 16 < 0.1%
 
1.942731923 15 < 0.1%
 
3.354949272 15 < 0.1%
 
2.796738683 15 < 0.1%
 
3.116687784 15 < 0.1%
 
2.886858389 14 < 0.1%
 
3.396243998 14 < 0.1%
 
3.195363167 14 < 0.1%
 
3.400294632 12 < 0.1%
 
Other values (143988) 149850 99.9%
 
ValueCountFrequency (%) 
-4.295588903 1 < 0.1%
 
-4.236904217 1 < 0.1%
 
-4.200243504 1 < 0.1%
 
-4.169894283 1 < 0.1%
 
-4.11464681 1 < 0.1%
 
ValueCountFrequency (%) 
7.320308375 1 < 0.1%
 
7.299601831 1 < 0.1%
 
7.270447755 1 < 0.1%
 
7.251385223 1 < 0.1%
 
7.189598872 1 < 0.1%
 

v_2
Real number (ℝ)

HIGH CORRELATION
Distinct count143997
Unique (%)96.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.08076505845
Minimum-4.47067143
Maximum19.0354965
Zeros0
Zeros (%)0.0%
Memory size1.1 MiB

Quantile statistics

Minimum-4.47067143
5-th percentile-1.78064028
Q1-0.9706712005
median-0.3829468904
Q30.2413348522
95-th percentile1.228078873
Maximum19.0354965
Range23.50616793
Interquartile range (IQR)1.212006053

Descriptive statistics

Standard deviation2.929617945
Coefficient of variation (CV)36.27333405
Kurtosis23.86059102
Mean0.08076505845
Median Absolute Deviation (MAD)1.241902009
Skewness4.842555904
Sum12114.75877
Variance8.582661304
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[-4.47067143 -3.4740065 -3.17110227 -2.88234442 -2.70919733 ... 16.91136898 17.20052575 17.64244173 18.1843126 19.0354965 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
-0.3498599506 20 < 0.1%
 
0.8265767659 16 < 0.1%
 
0.8877624977 15 < 0.1%
 
-0.1583641418 15 < 0.1%
 
0.9806164814 15 < 0.1%
 
0.0307076025 15 < 0.1%
 
-0.0281644461 14 < 0.1%
 
-0.1289230628 14 < 0.1%
 
0.9858723586 14 < 0.1%
 
0.470568901 12 < 0.1%
 
Other values (143987) 149850 99.9%
 
ValueCountFrequency (%) 
-4.47067143 1 < 0.1%
 
-4.155742905 1 < 0.1%
 
-4.022534299 1 < 0.1%
 
-3.741392553 1 < 0.1%
 
-3.737970421 2 < 0.1%
 
ValueCountFrequency (%) 
19.0354965 1 < 0.1%
 
18.99430891 1 < 0.1%
 
18.80211771 1 < 0.1%
 
18.67641395 1 < 0.1%
 
18.65675078 1 < 0.1%
 

v_3
Real number (ℝ)

HIGH CORRELATION
Distinct count143998
Unique (%)96.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.07883342346
Minimum-7.275036707
Maximum9.854701534
Zeros0
Zeros (%)0.0%
Memory size1.1 MiB

Quantile statistics

Minimum-7.275036707
5-th percentile-3.173722911
Q1-1.462580044
median0.09972198465
Q31.565838202
95-th percentile3.34599903
Maximum9.854701534
Range17.12973824
Interquartile range (IQR)3.028418246

Descriptive statistics

Standard deviation2.026514036
Coefficient of variation (CV)25.70627973
Kurtosis-0.4180058877
Mean0.07883342346
Median Absolute Deviation (MAD)1.676853293
Skewness0.1062920405
Sum11825.01352
Variance4.106759138
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[-7.27503671 -5.43567292 -5.09078386 -4.75762091 -4.51404742 ... 5.48296103 6.14544317 6.88336337 8.47000281 9.85470153], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
-0.2182005123 20 < 0.1%
 
-1.312989982 16 < 0.1%
 
-2.006612314 15 < 0.1%
 
-1.619872925 15 < 0.1%
 
-1.61243245 15 < 0.1%
 
-1.382756053 15 < 0.1%
 
-1.881849681 14 < 0.1%
 
-1.713509257 14 < 0.1%
 
-1.690953035 14 < 0.1%
 
-1.199487739 12 < 0.1%
 
Other values (143988) 149850 99.9%
 
ValueCountFrequency (%) 
-7.275036707 1 < 0.1%
 
-5.817029459 1 < 0.1%
 
-5.50069172 1 < 0.1%
 
-5.455090672 1 < 0.1%
 
-5.416255161 1 < 0.1%
 
ValueCountFrequency (%) 
9.854701534 1 < 0.1%
 
9.369226864 1 < 0.1%
 
9.181525054 1 < 0.1%
 
9.121567529 1 < 0.1%
 
9.102653962 1 < 0.1%
 

v_4
Real number (ℝ)

HIGH CORRELATION
Distinct count143998
Unique (%)96.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.01787461475
Minimum-4.364565242
Maximum6.82935164
Zeros0
Zeros (%)0.0%
Memory size1.1 MiB

Quantile statistics

Minimum-4.364565242
5-th percentile-1.694681469
Q1-0.9211914838
median-0.07591042907
Q30.8687584354
95-th percentile2.100814569
Maximum6.82935164
Range11.19391688
Interquartile range (IQR)1.789949919

Descriptive statistics

Standard deviation1.193661387
Coefficient of variation (CV)66.77969868
Kurtosis-0.1972952432
Mean0.01787461475
Median Absolute Deviation (MAD)0.9851313333
Skewness0.3679889702
Sum2681.192212
Variance1.424827506
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[-4.36456524 -3.89893376 -2.95924778 -2.60864082 -2.42520141 ... 3.70756539 4.2269578 4.50015967 4.95728189 6.82935164], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
-1.626828233 20 < 0.1%
 
0.6967751761 16 < 0.1%
 
-0.3658995025 15 < 0.1%
 
-0.3649862222 15 < 0.1%
 
-0.3798153928 15 < 0.1%
 
1.23545786 15 < 0.1%
 
1.617135927 14 < 0.1%
 
-0.3048146874 14 < 0.1%
 
-0.3932832081 14 < 0.1%
 
-0.2910286045 12 < 0.1%
 
Other values (143988) 149850 99.9%
 
ValueCountFrequency (%) 
-4.364565242 1 < 0.1%
 
-4.233625966 1 < 0.1%
 
-4.211500995 1 < 0.1%
 
-4.156168164 1 < 0.1%
 
-4.095555032 1 < 0.1%
 
ValueCountFrequency (%) 
6.82935164 1 < 0.1%
 
4.95909398 1 < 0.1%
 
4.955469803 1 < 0.1%
 
4.87727069 1 < 0.1%
 
4.846567074 1 < 0.1%
 

v_5
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS
Distinct count139624
Unique (%)93.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.2482035284
Minimum0
Maximum0.2918381131
Zeros4485
Zeros (%)3.0%
Memory size1.1 MiB

Quantile statistics

Minimum0
5-th percentile0.2270593411
Q10.2436153531
median0.2577979663
Q30.2652972593
95-th percentile0.2779034901
Maximum0.2918381131
Range0.2918381131
Interquartile range (IQR)0.02168190624

Descriptive statistics

Standard deviation0.04580397102
Coefficient of variation (CV)0.1845419818
Kurtosis22.93408106
Mean0.2482035284
Median Absolute Deviation (MAD)0.02064245697
Skewness-4.737093909
Sum37230.52926
Variance0.002098003761
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.00000000e+00 5.91495975e-06 8.71099067e-04 2.04117919e-01 2.08683505e-01 ... 2.85179253e-01 2.86026893e-01 2.87436368e-01 2.91008387e-01 2.91838113e-01], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 4485 3.0%
 
0.2694058132 20 < 0.1%
 
0.2562264318 16 < 0.1%
 
0.2610820816 15 < 0.1%
 
0.2770966197 15 < 0.1%
 
0.2609957841 15 < 0.1%
 
0.2657638553 15 < 0.1%
 
0.263125818 14 < 0.1%
 
0.2606313746 14 < 0.1%
 
0.2773974215 14 < 0.1%
 
Other values (139614) 145377 96.9%
 
ValueCountFrequency (%) 
0 4485 3.0%
 
1.182991951e-05 1 < 0.1%
 
3.541627639e-05 1 < 0.1%
 
4.983770368e-05 1 < 0.1%
 
9.987459124e-05 1 < 0.1%
 
ValueCountFrequency (%) 
0.2918381131 1 < 0.1%
 
0.2916221047 1 < 0.1%
 
0.291473069 1 < 0.1%
 
0.2914312843 1 < 0.1%
 
0.291293837 1 < 0.1%
 

v_6
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS
Distinct count109766
Unique (%)73.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.04492300431
Minimum0
Maximum0.1514195959
Zeros35465
Zeros (%)23.6%
Memory size1.1 MiB

Quantile statistics

Minimum0
5-th percentile0
Q13.811099819e-05
median0.0008120586185
Q30.1020092977
95-th percentile0.119640585
Maximum0.1514195959
Range0.1514195959
Interquartile range (IQR)0.1019711867

Descriptive statistics

Standard deviation0.05174278749
Coefficient of variation (CV)1.151810488
Kurtosis-1.742566654
Mean0.04492300431
Median Absolute Deviation (MAD)0.05022600402
Skewness0.3680730421
Sum6738.450646
Variance0.002677316057
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.00000000e+00 7.49451978e-09 5.28032345e-05 5.28289684e-05 3.97418565e-04 ... 1.37468348e-01 1.39768756e-01 1.46754618e-01 1.47006910e-01 1.51419596e-01], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 35465 23.6%
 
5.281627533e-05 20 < 0.1%
 
0.09347230563 16 < 0.1%
 
0.07521186781 15 < 0.1%
 
0.08756163723 15 < 0.1%
 
0.09502883385 15 < 0.1%
 
0.09188745529 15 < 0.1%
 
0.09545229844 14 < 0.1%
 
0.08870361553 14 < 0.1%
 
0.09285305161 14 < 0.1%
 
Other values (109756) 114397 76.3%
 
ValueCountFrequency (%) 
0 35465 23.6%
 
1.498903956e-08 1 < 0.1%
 
1.855559964e-08 1 < 0.1%
 
3.458416163e-08 1 < 0.1%
 
5.365260697e-08 1 < 0.1%
 
ValueCountFrequency (%) 
0.1514195959 1 < 0.1%
 
0.1511057713 1 < 0.1%
 
0.1503892133 1 < 0.1%
 
0.1503220462 1 < 0.1%
 
0.1499101503 1 < 0.1%
 

v_7
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS
Distinct count138709
Unique (%)92.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.1246924611
Minimum0
Maximum1.404936375
Zeros5467
Zeros (%)3.6%
Memory size1.1 MiB

Quantile statistics

Minimum0
5-th percentile0.007337314046
Q10.06247353269
median0.09586589827
Q30.1252429449
95-th percentile0.1681911025
Maximum1.404936375
Range1.404936375
Interquartile range (IQR)0.06276941225

Descriptive statistics

Standard deviation0.2014095303
Coefficient of variation (CV)1.615250261
Kurtosis25.84548929
Mean0.1246924611
Median Absolute Deviation (MAD)0.07589027857
Skewness5.130233019
Sum18703.86917
Variance0.04056579891
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.00000000e+00 4.39435175e-07 1.01849909e-04 1.04205127e-02 1.47925155e-02 ... 1.31277095e+00 1.32231307e+00 1.33232577e+00 1.34979951e+00 1.40493638e+00], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 5467 3.6%
 
0.1242125722 20 < 0.1%
 
0.1302719182 16 < 0.1%
 
0.139667344 15 < 0.1%
 
0.05167975228 15 < 0.1%
 
0.08159521024 15 < 0.1%
 
0.1409074869 15 < 0.1%
 
0.1408648003 14 < 0.1%
 
0.07292957058 14 < 0.1%
 
0.0451710962 14 < 0.1%
 
Other values (138699) 144395 96.3%
 
ValueCountFrequency (%) 
0 5467 3.6%
 
8.788703506e-07 1 < 0.1%
 
4.967235703e-06 1 < 0.1%
 
5.313628437e-06 1 < 0.1%
 
7.115923657e-06 1 < 0.1%
 
ValueCountFrequency (%) 
1.404936375 1 < 0.1%
 
1.401999012 1 < 0.1%
 
1.387846594 1 < 0.1%
 
1.385366787 1 < 0.1%
 
1.372934179 1 < 0.1%
 

v_8
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS
Distinct count142451
Unique (%)95.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.05814385476
Minimum0
Maximum0.1607909853
Zeros1597
Zeros (%)1.1%
Memory size1.1 MiB

Quantile statistics

Minimum0
5-th percentile0.01353156407
Q10.03533368675
median0.05701359764
Q30.07938157081
95-th percentile0.1087993996
Maximum0.1607909853
Range0.1607909853
Interquartile range (IQR)0.04404788406

Descriptive statistics

Standard deviation0.02918575568
Coefficient of variation (CV)0.5019577013
Kurtosis-0.6362252576
Mean0.05814385476
Median Absolute Deviation (MAD)0.02418387461
Skewness0.204613257
Sum8721.578214
Variance0.0008518083346
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.00000000e+00 4.61464116e-07 3.54723847e-03 8.51163176e-03 1.16955203e-02 ... 1.30983090e-01 1.35642177e-01 1.41363694e-01 1.48809588e-01 1.60790985e-01], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 1597 1.1%
 
0.06735800888 20 < 0.1%
 
0.07474168883 16 < 0.1%
 
0.0732681433 15 < 0.1%
 
0.07590468059 15 < 0.1%
 
0.07666355645 15 < 0.1%
 
0.08469556352 15 < 0.1%
 
0.07865835259 14 < 0.1%
 
0.08058404544 14 < 0.1%
 
0.07537557616 14 < 0.1%
 
Other values (142441) 148265 98.8%
 
ValueCountFrequency (%) 
0 1597 1.1%
 
9.229282312e-07 1 < 0.1%
 
2.029666399e-06 1 < 0.1%
 
4.413213728e-06 1 < 0.1%
 
6.403629719e-06 1 < 0.1%
 
ValueCountFrequency (%) 
0.1607909853 1 < 0.1%
 
0.1597096024 1 < 0.1%
 
0.1577662813 1 < 0.1%
 
0.1555761419 1 < 0.1%
 
0.1538494068 1 < 0.1%
 

v_9
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS
Distinct count140617
Unique (%)93.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.0619958947
Minimum0
Maximum0.2227874876
Zeros3486
Zeros (%)2.3%
Memory size1.1 MiB

Quantile statistics

Minimum0
5-th percentile0.009496496083
Q10.03393017697
median0.05848366702
Q30.08749054835
95-th percentile0.1248414393
Maximum0.2227874876
Range0.2227874876
Interquartile range (IQR)0.05356037137

Descriptive statistics

Standard deviation0.03569197873
Coefficient of variation (CV)0.575715197
Kurtosis-0.3214911789
Mean0.0619958947
Median Absolute Deviation (MAD)0.02964241053
Skewness0.4195007497
Sum9299.384205
Variance0.001273917346
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.00000000e+00 6.35997285e-07 2.20420317e-03 5.18913701e-03 7.77194124e-03 ... 1.63615388e-01 1.70953420e-01 1.83589221e-01 1.95963968e-01 2.22787488e-01], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 3486 2.3%
 
0.01486672915 20 < 0.1%
 
0.08276476968 16 < 0.1%
 
0.1011497378 15 < 0.1%
 
0.05153510167 15 < 0.1%
 
0.04587739606 15 < 0.1%
 
0.04676530325 15 < 0.1%
 
0.04961071498 14 < 0.1%
 
0.04820274816 14 < 0.1%
 
0.11088657 14 < 0.1%
 
Other values (140607) 146376 97.6%
 
ValueCountFrequency (%) 
0 3486 2.3%
 
1.271994569e-06 1 < 0.1%
 
3.552864466e-06 1 < 0.1%
 
5.529589418e-06 1 < 0.1%
 
7.708846052e-06 1 < 0.1%
 
ValueCountFrequency (%) 
0.2227874876 1 < 0.1%
 
0.2136169591 1 < 0.1%
 
0.2117691737 1 < 0.1%
 
0.211074674 1 < 0.1%
 
0.2104021627 1 < 0.1%
 

v_10
Real number (ℝ)

HIGH CORRELATION
Distinct count143997
Unique (%)96.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-0.001000238809
Minimum-9.16819241
Maximum12.35701062
Zeros0
Zeros (%)0.0%
Memory size1.1 MiB

Quantile statistics

Minimum-9.16819241
5-th percentile-5.65890342
Q1-3.72230288
median1.624076331
Q32.844356776
95-th percentile4.056172056
Maximum12.35701062
Range21.52520303
Interquartile range (IQR)6.566659656

Descriptive statistics

Standard deviation3.772386394
Coefficient of variation (CV)-3771.485731
Kurtosis-0.5779350635
Mean-0.001000238809
Median Absolute Deviation (MAD)3.369765156
Skewness0.02522046412
Sum-150.0358213
Variance14.23089911
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[-9.16819241 -8.24190528 -7.56197726 -7.3953947 -6.75885393 ... 11.05498045 11.36691185 11.52950763 11.85815972 12.35701062], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2.329386406 20 < 0.1%
 
-4.303480552 16 < 0.1%
 
-3.163235777 15 < 0.1%
 
-4.757358503 15 < 0.1%
 
-4.383928893 15 < 0.1%
 
-4.047399926 15 < 0.1%
 
-4.686442272 14 < 0.1%
 
-3.950857205 14 < 0.1%
 
-4.815884497 14 < 0.1%
 
-3.877387325 12 < 0.1%
 
Other values (143987) 149850 99.9%
 
ValueCountFrequency (%) 
-9.16819241 1 < 0.1%
 
-9.109525099 1 < 0.1%
 
-8.798809873 1 < 0.1%
 
-8.776898545 1 < 0.1%
 
-8.563886287 1 < 0.1%
 
ValueCountFrequency (%) 
12.35701062 1 < 0.1%
 
12.3193027 1 < 0.1%
 
12.28529882 1 < 0.1%
 
12.18050295 1 < 0.1%
 
12.16900107 1 < 0.1%
 

v_11
Real number (ℝ)

Distinct count143997
Unique (%)96.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.009034543473
Minimum-5.558206704
Maximum18.81904247
Zeros0
Zeros (%)0.0%
Memory size1.1 MiB

Quantile statistics

Minimum-5.558206704
5-th percentile-3.373762442
Q1-1.951543007
median-0.3580526972
Q31.255021657
95-th percentile2.840711833
Maximum18.81904247
Range24.37724917
Interquartile range (IQR)3.206564665

Descriptive statistics

Standard deviation3.286071221
Coefficient of variation (CV)363.7229962
Kurtosis12.56873147
Mean0.009034543473
Median Absolute Deviation (MAD)2.053576657
Skewness3.029145813
Sum1355.181521
Variance10.79826407
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[-5.5582067 -5.16246721 -4.80694188 -4.61120433 -4.39912512 ... 16.39412831 17.56993825 18.32770803 18.64558553 18.81904247], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
-2.255590644 20 < 0.1%
 
-0.3300530272 16 < 0.1%
 
-1.107940254 15 < 0.1%
 
-0.8026135578 15 < 0.1%
 
-1.436493608 15 < 0.1%
 
-0.07987170397 15 < 0.1%
 
-1.229805655 14 < 0.1%
 
-1.124892465 14 < 0.1%
 
-0.3059463708 14 < 0.1%
 
0.09586985004 12 < 0.1%
 
Other values (143987) 149850 99.9%
 
ValueCountFrequency (%) 
-5.558206704 1 < 0.1%
 
-5.403044225 1 < 0.1%
 
-5.391154102 1 < 0.1%
 
-5.366580189 1 < 0.1%
 
-5.362860991 1 < 0.1%
 
ValueCountFrequency (%) 
18.81904247 1 < 0.1%
 
18.80207184 1 < 0.1%
 
18.80121836 1 < 0.1%
 
18.7871018 1 < 0.1%
 
18.76544301 1 < 0.1%
 

v_12
Real number (ℝ)

Distinct count143997
Unique (%)96.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.004812595252
Minimum-9.639552114
Maximum13.84779152
Zeros0
Zeros (%)0.0%
Memory size1.1 MiB

Quantile statistics

Minimum-9.639552114
5-th percentile-3.790116166
Q1-1.871845761
median-0.1307533175
Q31.776932949
95-th percentile4.100404318
Maximum13.84779152
Range23.48734364
Interquartile range (IQR)3.64877871

Descriptive statistics

Standard deviation2.517477676
Coefficient of variation (CV)523.1018908
Kurtosis0.2689373892
Mean0.004812595252
Median Absolute Deviation (MAD)2.045387974
Skewness0.3653576011
Sum721.8892878
Variance6.33769385
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[-9.63955211 -7.65617982 -7.24172116 -6.28599508 -5.97261324 ... 7.08087535 9.20432382 11.01967833 12.2537301 13.84779152], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.8474330653 20 < 0.1%
 
2.486296944 16 < 0.1%
 
2.375255682 15 < 0.1%
 
3.097962759 15 < 0.1%
 
2.10447048 15 < 0.1%
 
2.515776809 15 < 0.1%
 
2.76974006 14 < 0.1%
 
2.387382343 14 < 0.1%
 
2.260273184 14 < 0.1%
 
2.291053577 12 < 0.1%
 
Other values (143987) 149850 99.9%
 
ValueCountFrequency (%) 
-9.639552114 1 < 0.1%
 
-9.404105892 1 < 0.1%
 
-9.288895057 1 < 0.1%
 
-9.223992577 1 < 0.1%
 
-8.679290001 1 < 0.1%
 
ValueCountFrequency (%) 
13.84779152 1 < 0.1%
 
13.56201137 1 < 0.1%
 
13.11304284 1 < 0.1%
 
13.08366141 1 < 0.1%
 
12.97305681 1 < 0.1%
 

v_13
Real number (ℝ)

HIGH CORRELATION
Distinct count143998
Unique (%)96.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.0003126119439
Minimum-4.153898796
Maximum11.14766861
Zeros0
Zeros (%)0.0%
Memory size1.1 MiB

Quantile statistics

Minimum-4.153898796
5-th percentile-1.931083984
Q1-1.057788984
median-0.03624460356
Q30.9428130826
95-th percentile2.187794499
Maximum11.14766861
Range15.30156741
Interquartile range (IQR)2.000602067

Descriptive statistics

Standard deviation1.288987639
Coefficient of variation (CV)4123.283402
Kurtosis-0.4382740227
Mean0.0003126119439
Median Absolute Deviation (MAD)1.071740641
Skewness0.2679151914
Sum46.89179159
Variance1.661489135
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[-4.1538988 -4.07352205 -4.0006869 -3.76113232 -3.69269149 ... 3.2658455 3.54690774 3.87934689 5.24966096 11.14766861], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
-1.698496921 20 < 0.1%
 
-0.04346267135 16 < 0.1%
 
0.548696614 15 < 0.1%
 
-0.8346971591 15 < 0.1%
 
-0.499452746 15 < 0.1%
 
-0.3678552551 15 < 0.1%
 
-0.6628001058 14 < 0.1%
 
-0.3759573692 14 < 0.1%
 
1.037323962 14 < 0.1%
 
-0.4777935057 12 < 0.1%
 
Other values (143988) 149850 99.9%
 
ValueCountFrequency (%) 
-4.153898796 1 < 0.1%
 
-4.102768496 1 < 0.1%
 
-4.074922408 1 < 0.1%
 
-4.0721217 1 < 0.1%
 
-4.069742029 1 < 0.1%
 
ValueCountFrequency (%) 
11.14766861 1 < 0.1%
 
5.249749785 1 < 0.1%
 
5.249572137 1 < 0.1%
 
5.23233722 1 < 0.1%
 
5.228602135 1 < 0.1%
 

v_14
Real number (ℝ)

Distinct count143998
Unique (%)96.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-0.0006882314377
Minimum-6.546555965
Maximum8.658417877
Zeros0
Zeros (%)0.0%
Memory size1.1 MiB

Quantile statistics

Minimum-6.546555965
5-th percentile-2.109160871
Q1-0.4370336682
median0.1412459925
Q30.6803780745
95-th percentile1.365639579
Maximum8.658417877
Range15.20497384
Interquartile range (IQR)1.117411743

Descriptive statistics

Standard deviation1.038685151
Coefficient of variation (CV)-1509.209104
Kurtosis2.393525934
Mean-0.0006882314377
Median Absolute Deviation (MAD)0.7626297077
Skewness-1.186355325
Sum-103.2347157
Variance1.078866844
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[-6.54655597 -6.12400021 -5.84279922 -4.85324552 -4.63320633 ... 2.24389309 2.40049531 2.60356717 2.7367176 8.65841788], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.003014936599 20 < 0.1%
 
-2.290343715 16 < 0.1%
 
-3.059444119 15 < 0.1%
 
1.02748673 15 < 0.1%
 
0.8695861711 15 < 0.1%
 
-1.330412484 15 < 0.1%
 
-0.5508258348 14 < 0.1%
 
-2.488781485 14 < 0.1%
 
0.9797061334 14 < 0.1%
 
1.162447648 12 < 0.1%
 
Other values (143988) 149850 99.9%
 
ValueCountFrequency (%) 
-6.546555965 1 < 0.1%
 
-6.124708498 1 < 0.1%
 
-6.123291914 1 < 0.1%
 
-6.113291304 1 < 0.1%
 
-6.066107398 1 < 0.1%
 
ValueCountFrequency (%) 
8.658417877 1 < 0.1%
 
2.743992717 1 < 0.1%
 
2.729442487 1 < 0.1%
 
2.697364726 1 < 0.1%
 
2.628865462 1 < 0.1%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Missing values

Sample

First rows

SaleIDnameregDatemodelbrandbodyTypefuelTypegearboxpowerkilometernotRepairedDamageregionCodecreatDatepricev_0v_1v_2v_3v_4v_5v_6v_7v_8v_9v_10v_11v_12v_13v_14
007362004040230.061.00.00.06012.50.0104620160404185043.3577963.9663440.0502572.1597441.1437860.2356760.1019880.1295490.0228160.097462-2.8818032.804097-2.4208210.7952920.914762
1122622003030140.012.00.00.0015.0NaN436620160309360045.3052735.2361120.1379251.380657-1.4221650.2647770.1210040.1357310.0265970.020582-4.9004822.096338-1.030483-1.7226740.245522
221487420040403115.0151.00.00.016312.50.0280620160402622245.9783594.8237921.319524-0.998467-0.9969110.2514100.1149120.1651470.0621730.027075-4.8467491.8035591.565330-0.832687-0.229963
337186519960908109.0100.00.01.019315.00.043420160312240045.6874784.492574-0.0506160.883600-2.2280790.2742930.1103000.1219640.0333950.000000-4.5095991.285940-0.501868-2.438353-0.478699
4411108020120103110.051.00.00.0685.00.0697720160313520044.3835112.0314330.572169-1.5712392.2460880.2280360.0732050.0918800.0788190.121534-1.8962400.9107830.9311102.8345181.923482
551376422009060224.0100.01.00.010910.00.0369020160319800046.323165-3.2292850.156615-1.727217-0.3456900.2602460.0005180.1198380.0909220.0487691.885526-2.7219432.457660-0.2869730.206573
6624021999041113.040.00.01.015015.00.0307320160317350046.1043354.9262190.1133111.644606-1.2703810.2679980.1176750.1423340.0254460.028174-4.9022001.610616-0.834605-1.996117-0.103180
771653461999070626.0141.00.00.010115.00.0400020160326100042.255586-3.167771-0.6766931.9426730.5242060.2395060.0000000.1229430.0398390.0824133.693829-0.245014-2.1928100.2367280.195567
8829742003020519.012.01.01.017915.00.0467920160326285046.0848884.8937170.4753330.556575-1.2624900.2638330.1165830.1442550.0398510.024388-4.9252341.5877960.075348-1.5510980.069433
9982021199801017.075.00.00.08815.00.03022016040265043.0746261.666386-2.2015453.0968610.8438520.2624730.0682670.0121760.0102910.098727-1.0895840.600683-4.1862100.198273-1.025822

Last rows

SaleIDnameregDatemodelbrandbodyTypefuelTypegearboxpowerkilometernotRepairedDamageregionCodecreatDatepricev_0v_1v_2v_3v_4v_5v_6v_7v_8v_9v_10v_11v_12v_13v_14
149990149990346612001120741.061.00.00.0608.00.060682016033145041.583949-3.180839-0.9795533.0723641.8357410.2297190.0000000.1156740.0266960.1236344.0844780.243821-3.5052391.2857900.856962
149991149991549592012040147.015.00.01.02115.00.04225201603212495048.7281805.6140350.703266-3.768265-1.3472440.2792640.1259220.0688980.0953200.012592-7.088874-0.4253714.044331-0.6338290.586724
1499921499921834992000120632.081.00.00.08215.00.051022016030995041.993906-3.126642-0.7958361.8089471.2032260.2347360.0000000.1058340.0420960.1024353.735963-0.176973-2.3532030.998859-0.085879
1499931499937208720041103184.0110.01.00.01402.00.051920160324439944.175418-3.045687-0.886965-1.3924490.4484690.2553490.0004130.0414390.0824650.0742472.374701-2.1962980.6429200.900097-0.721607
149994149994430732012010742.011.00.00.01223.00.05053201603241478047.0551214.7332811.851484-1.8850761.2199550.2351290.1146280.1703230.0803250.091744-5.1382971.4424912.8125790.951352-1.600794
14999514999516397820000607121.0104.00.01.016315.00.0457620160327590045.316543-3.139095-1.269707-0.736609-1.5058200.2802640.0003100.0484410.0711580.0191741.988114-2.9839730.589167-1.304370-0.302592
14999614999618453520091102116.0110.00.00.012510.00.0282620160312950045.972058-3.143764-0.023523-2.3666990.6980120.2532170.0007770.0840790.0996810.0793711.839166-2.7746152.5539940.924196-0.272160
1499971499971475872010100360.0111.01.00.0906.00.0330220160328750044.733481-3.1057210.595454-2.2790911.4236610.2333530.0007050.1188720.1001180.0979142.439812-1.6306772.2901971.8919220.414931
149998149998459072006031234.0103.01.00.015615.00.0187720160401499945.658634-3.204785-0.441680-1.1798120.6206800.2563690.0002520.0814790.0835580.0814982.075380-2.6337191.4149370.431981-1.659014
1499991499991776721999020419.0286.00.01.019312.50.023520160305470045.536383-3.200326-1.612893-0.067144-1.3961660.2844750.0000000.0400720.0625430.0258191.978453-3.1799130.031724-1.483350-0.342674